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Abstract 

Metric graphs are ubiquitous in science and engineering. For example, many data are drawn from 
hidden spaces that are graph-like, such as the cosmic web. A metric graph offers one of the simplest 
yet still meaningful ways to represent the non-linear structure hidden behind the data. In this paper, we 
propose a new distance between two finite metric graphs, called the persistence-distortion distance, which 
draws upon a topological idea. This topological perspective along with the metric space viewpoint provide 
a new angle to the graph matching problem. Our persistence-distortion distance has two properties not 
shared by previous methods: First, it is stable against the perturbations of the input graph metrics. Second, 
it is a continuous distance measure, in the sense that it is defined on an alignment of the underlying spaces 
of input graphs, instead of merely their nodes. This makes our persistence-distortion distance robust 
against, for example, different discretizations of the same underlying graph. 

Despite considering the input graphs as continuous spaces, that is, taking all points into account, we 
show that we can compute the persistence-distortion distance in polynomial time. The time complexity for 
the discrete case where only graph nodes are considered is much faster. We also provide some preliminary 
experimental results to demonstrate the use of the new distance measure. 


1 Introduction 

Many data in science and engineering are drawn from a hidden space which are graph-like, such as the 
cosmic web [27] and road networks [1, 5]. Furthermore, as modern data becomes increasingly complex, 
understanding them with a simple yet still meaningful structure becomes important. Metric graphs equipped 
with a metric derived from the data can provide such a simple structure [17, 26]. They are graphs where 
each edge is associated with a length inducing the metric of shortest path distance. The comparison of the 
repersentative metric graphs can benefit classification of data, a fundamental task in processing them. This 
motivates the study of metric graphs in the context of matching or comparison. 

To compare two objects, one needs a notion of distance in the space where the objects are coming from. 
Various distance measures for graphs and their metric versions have been proposed in the literature with 
associated matching algorithms. We approach this problem with two new perspectives: (i) We aim to develop 
a distance measure which is both meaningful and stable against metric perturbations, and at the same time 
amenable to polynomial time computations, (ii) Unlike most previous distance measures which arc discrete 
in the sense that only graph node alignments are considered, we aim for a distance measure that is continuous, 
that is, alignment for all points in the underlying space of the metric graphs are considered. 

Related work. To date, the large number of proposed graph matching algorithms fall into two broad 
categories: exact graph matching methods and inexact graph matching (distances between graphs) methods. 
The exact graph matching, also called the graph isomorphism problem, checks whether there is a bijection 
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between the node sets of two input graphs that also induces a bijection in their edge sets. While polynomial 
time algorithms exist for many special cases, e.g., [2, 20, 24], for general graphs, it is not known whether the 
graph isomorphism problem is NP complete or not [16]. Nevertheless, given the importance of this problem, 
there are various exact graph matching algorithms developed in practice. Usually, these methods employ 
some pruning techniques aiming to reduce the search space for identifying graph isomorphisms. See [14] for 
comparisons of various graph isomorphism testing methods. 

In real world applications, input graphs often suffer from noise and deformation, and it is highly desirable 
to obtain a distance between two input graphs beyond the binary decision of whether they are the same 
(isomorphic) or not. This is referred to as inexact graph matching in the held of pattern recognition, and 
various distance measures have been proposed. One line of work is based on graph edit distance which is 
NP-hard to compute [31]. Many heuristic methods, using for example A* algorithms, have been proposed to 
address the issue of high computational complexity, see the survey [15] and references within. One of the 
main challenges in comparing two graphs is to determine how ’’good” a given alignment of graph nodes is in 
terms of the quality of the pairwise relations between those nodes. Hence matching two graphs naturally 
leads to an integer quadratic programming problem (IQP), which is a NP-hard problem. Several heuristic 
methods have been proposed to approach this optimization problem, such as the annealing approach of [18], 
iterative methods of [23, 29] and probabilistic approach in [30]. Finally, there have been several methods 
that formulate the optimization problem based on spectral properties of graphs. For example, in [28], the 
author uses the eigendecomposition of adjacency matrices of the input graphs to derive an expression of 
an orthogonal matrix which optimizes the objective function. In [9, 22], the principal eigenvector of a 
“compatibility” matrix of the input graphs is used to obtain correspondences between input graph nodes. 
Recently in [21], Hu et. al proposed the general and descriptive Laplacian family signatures to build the 
compatibility matrix and model the graph matching problem as an integer quadratic program. 

New work. Different from previous approaches, we view input graphs as continuous metric spaces. Intu¬ 
itively, we assume that our input is a finite graph G = ( V. E) where each edge is assigned a positive length 
value. We now consider G as a metric space (|Gj, dc) on the underlying space \G\ of G, with metric dc: 
being the shortest path metric in |G|. Given two metric graphs Gj and Go, a natural way to measure their 
distance is to use the so-called Gromov-Hausdorff distance [19, 25] to measure the metric distortion between 
these two metric spaces. Unfortunately, it is NP-hard to even approximate the Gromov-Hausdorff distance for 
graphs within a constant factor 1 . Instead, we propose a new metric, called the persistence-distortion distance 
dpnfG'i. Go), which draws upon a topological idea and is computable in polynomial time with techniques 
from computational geometry. This provides a new angle to the graph comparison problem, and our distance 
has several nice properties: 

(1) The persistence-distortion distance takes all points in the input graphs into account, while " 
all previous graph matching algorithms align only graph nodes. Hence our persistence-distortion 
distance is insensitive to different discretization of the same graph: For example, the two geometric 
graphs on the right are equivalent as metric graphs, and thus the persistence-distortion distance 
between them is zero. 

(2) In Section 3, we show that our persistence-distortion dis¬ 
tance dpofGj ■ Gf) is stable w.r.t. changes to input metric graphs 
as measured by the Gromov-Hausdorff distance. For example, the 
two geometric graphs on the right have small persistence-distortion 
distance. (Imagine that they are the reconstructed road networks 
from noisy data sampled from the same road systems.) 

'This result has been very recently obtained by two groups of researchers independently: Agarwal, Fox and Nath from Duke 
Univ., and Sidiropoulos and Wang from Ohio State Univ. 
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(3) Despite that our persistence-distortion distance is a continuous measure which considers all points in 
the input graphs, we show in Section 5 that it can be computed in polynomial time (0(m 12 logm) where 
m is the total complexity of input graphs). We note that the discrete version of our persistence-distortion 
distance, where only graph nodes are considered (much like in previous graph matching algorithms), can be 
computed much more efficiently in 0(n 2 m 15 log rn) time, where n is the number of graph nodes in input 
graphs. 

Finally, we also provide some preliminary experimental results to demonstrate the use of the persistence- 
distortion distance. 

2 Notations and Proposed Distance Measure for Graphs 

Metric graphs. A metric graph is a metric space (M, d) where M is the underlying space of a finite 
1-dimensional simplicial complex. Given a graph G = ( V. E) and a weight function Len : E —> M + on its 
edge set E (assigning length to edges in E), we can associate a metric graph (\G\ , do) to it as follows. The 
space | G | is a geometric realization of G. Let \e\ denote the image of an edge e E E in |G|. To define the 
metric da, we consider the arclength parameterization e : [0, Len(e)] —> |e| for every edge e E E and define 
the distance between any two points x, y E |e| as da(x, y ) = |e _ 1 (y) — e _ 1 (at)|. This in turn provides the 
length of a path ir(z , w ) between two points z,w E |G| that are not necessarily on the same edge in |G|, by 
simply summing up the lengths of the restrictions of this path to edges in G. Finally, given any two points 
z,w E \G\. the distance da(z, w) is given by the minimum length of any path connecting z to w in \G\. 

In what follows, we do not distinguish between | • | and its argument and write (G. da) to denote the metric 
graph (\G\,da) for simplicity. Furthermore, for simplicity in presentation, we abuse the notations slightly 
and refer to the metric graph as G = (V, E), with the understanding that (V, E) refers to the topological 
graph behind the metric space (G, da). Finally, we refer to any point x € G as a point, while a point x E V 
as a graph node. 

Background on persistent homology. The definition of our proposed distance measure for two metric 
graphs relies on the so-called persistence diagram induced by a scalar function. We refer the readers to 
resources such as [11, 12] for formal discussions on persistent homology and related developments. Below 
we only provide an intuitive and informal description of the persistent homology induced by a function under 
our simple setting. 

Let / : X —> 1R be a continuous real-valued function defined on a topological space X. We want to 
understand the structure of X from the perspective of the scalar function /: Specifically, let X a := {x E X 
f(x) > a} denote the super-level set 2 of X w.r.t. a E Hi. Now as we sweep X top-down by decreasing the 
a value, the sequence of super-level sets connected by natural inclusion maps gives rise to a filtration of X 
induced by f: 

ricr 2 C...CI“” = I, for <*1 > a 2 > > a m . (1) 

We track how the topological features captured by the so-called homology classes of the super-level sets 
change. In particular, as a decreases, sometimes new topological features are “born” at time a, that is, new 
families of homology classes are created in H/.(X a ). the k- th homology group of X a . Sometimes, existing 
topological features disappear, i.e, some homology classes become trivial in H/-(X^) for some (3 < a. The 
persistent homology captures such birth and death events, and summarizes them in the so-called persistence 
diagram Dg fc (/). Specifically, Dg k (f) consists of a set of points {(a, /3) E 1R 2 } in the plane, where each 
( a , j3) indicates a homological feature created at time a and killed at time (3. 

2 In the standard formulation of persistent homology of a scalar field, the sub-level set X a = {x G X \ f[x) < a} is often used. 
We use super-level sets which suit the specific functions that we use. 


3 



In our setting, the domain X will be the underlying space of a metric graph G. The specific function 
that we use later is the geodesic distance to a fixed basepoint s £ G, that is, we consider / : G —> 1R where 
f(x) = d(j{s, x ) for any x G G. We are only interested in the Oth-dimensional persistent homology (k = 0 
in the above description), which simply tracks the connected components in the super-level set as we vary a. 





(a) 


(b) 


(c) 


(d) 


Figure 1: (a) A graph with basepoint s: edge length is marked for each edge, (b) The function / = (lc(s. •). 
We also indicate critical-pairs, (c) Persistence diagram Dg 0 f: E.g, the persistence-point (6, 5) is generated 
by critical-pair (u, t':j). (d) shows a partial matching between the red points and blue points (representing two 
persistence diagrams). Some points are matched to the diagonal L. 


Figure 1 gives an example of the 0-th persistence diagram Dg 0 (/) with the basepoint s in edge (vq. v\ ). 
As we sweep the graph top-down in terms of the geodesic function /, a new connected component is created 
as we pass through a local maximum Ub of the function / = dais. •). A local maximum of /, such as u in 
Figure 1 (b), is not necessarily a graph node from V. Two connected components in the super-level set can 
only merge at an up-fork saddle <x c j of the function /: The up-fork saddle is a point such that within a 
sufficiently small neighborhood of u^, there are at least two branches incident on u<± with function values 
larger than . Each point (b, d) in the persistence diagram is called a persistence point, corresponding to 
the creation and death of some connected component: At time b, a new component is created in X h at a 
local maximum Ub G G with f{ub) = b. At time d and at an up-fork saddle <G G with f(u d) = d, this 
component merges with another component created earlier. We refer to the pair of points (ub, uf) from the 
graph G as the critical-pair corresponding to persistent point (b, d). We call b and d the birth-time and 
death-time, respectively. The plane containing the persistence diagram is called the birth-death plane. 

Finally, given two finite persistence diagrams Dg = {pi,... ,pi G 1R 2 } and Dg 7 = {q\..... (p. G 1R 2 }, a 
common distance measure for them, the bottleneck distance dg(Dg, Dg 7 ) [6], is defined as follows: Consider 
Dg and Dg 7 as two finite sets of points in the plane (where points may overlap). Call L = {(x,x) G 1R 2 } the 
diagonal of the birth-death plane. 

Definition 1 A partial matching C of Dg and Dg 7 is a relation C : (Dg IJ L) x (Dg 7 U L) such that each 
point in Dg is either matched to a unique point in Dg 7 , or mapped to its closest point (under L^-norm) in 
the diagonal L; and the same holds for points in Dg 7 . See Figure 1(d). The bottleneck distance is defined 
as ds{ Dg, Dg 7 ) = mine max (p,<j)eC II P ~ hWoc, where C ranges over all possible partial matchings of Dg 
and Dg 7 . We call the parital matching that achieves the bottleneck distance c//j(I)g. Dg 7 ) as the bottleneck 
matching. 


Proposed persistence-distortion distance for metric graphs. Suppose we are given two metric graphs 
(Gi,cZgi) and (G 2 ,d G f). Eet (Vi,E\) and (V 2 , E 2 ) denote the node set and edge set for Gi and G 2 , 
respectively. Set n = max{|Vi|, | V 2 I} and m = max{|£’i|, 

Choose any point s 6 Gj as the base point, and consider the shortest path distance function dc 1 , s : Gi —> 
1R defined as dc 1 , s (x) = dc , (s, x) for any point x £ G 1 . Eet P s denote the 0-th dimensional persistence 
diagram Dg 0 (dG liS ) induced by the function dc;, ,s- Define da 2 .t. an d Qt similarly for any base point t G G 2 
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for the graph G 2 . We map the graph Gi to the set of (infinite number of) points in the space of persistence 
diagrams ID), denoted by C := {P s | s G Gi}. Similarly, map the graph G2 to T := {Qt | t G G2}. 

Definition 2 The persistence-distortion distance between Gi and G 2 , denoted by dpofG 1 , G 2 ), is the 
Hausdorff distance du(C. J- ) between the two sets C and T where the distance between two persistence 
diagrams is measured by the bottleneck distance. In other words, 

dpn (Gi, G2) = drj(C, J 7 ) = max{ max min dnCP, Q), max min dnlP, Q) |. 

Pec QeT QeJ- Pec 







Figure 2: In the top row, we show three components A, B and C, that will be used to construct the two 
input metric graphs G\ and G 2 in the bottom row. All edges are of length 1. Each of these compnent has 
the property, that from a basepoint outside these components, their persistence diagram w.r.t. the geodesic 
distance remains the same (and hence cannot be distinguished). Graphs G\ and Go are not isomorphic. 
However, they are mapped to the same image set in the space of persistence diagrams, and hence the 
persistence-distortion distance between them is zero. 


Remark. (1) We note that if two graphs are isomotphic, then dpD(Gi, G 2 ) = 0. The inverse unfortunately 
is not true. (See Figure 2 for an example where two graphs have dpo(Gi, G 2 ) = 0, but they are not 
isomorphic.) Hence dpp is a pseudo-metric (it inherits the triangle-inequality property from the Hausdorff 
distance). (2) While the above definition uses only the 0-th persistence diagram for the geodesic distance 
functions, all our results hold with the same time complexity when we also include the 1 st-extended persistence 
diagram [7] or equivalently lst-interval persistence diagram [10] for each geodesic distance function dc 1:S 
(resp. d G2 , t). 


3 Stability of persistence-distortion distance 

Gromov-Hausdorff distance. There is a natural way to measure metric distortion between metric spaces 
(thus for metric graphs), called the Gromov-Hausdorff distance [19,4]. Given two metric spaces X = (X , dx) 
and y = ( Y, dy ), a correspondance between X and y is a relation M : X x Y such that (i) for any x G X, 
there exists (x, y) G M and (ii) for any if G Y, there exists (x 1 , if ) G M.. The Gromov-Hausdorff distance 
between X and y is 


dGH {X,y)=~ inf max | d x (xi, x 2 ) - dy (yi, 2 / 2 )|, (2) 
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where Af ranges over all correspondences of X x Y. The Gromov-Hausdorff distance is a natural measure¬ 
ment for distance between two metric spaces; see [25] for more discussions. Unfortunately, so far, there is 
no efficient (polynomial-time) algorithm to compute nor approximate this distance, even for special metric 
spaces - In fact, it has been recently shown that even the discrete Gromov-Hausdorff distance for metric trees 
(where only tree nodes are considered) is NP-hard to compute, as well as to approximate within a constant 
factor (recall footnote 1). In contrast, as we show in Section 4 and 5, the persistence-distortion distance can 
be computed in polynomial time. 

On the other hand, we have the following stability result, which intuitively suggests that the persistence- 
distortion distance is a weaker relaxation of the Gromov-Hausdorff distance. The proof of this theorem 
leverages a recent result on measuring distances between the Reeb graphs [3] and can be found in the full 
version. 

Theorem 3 (Stability) dpD(Gi, G 2 ) < 6dGH(Gi, G 2 ). 

By triangle inequality, this also implies that given two metric graphs Gi and G 2 and their perturbations 
G^ and G 2 , respectively, we have that: 

dpoic;, G' 2 ) < d PD (Gi, G 2 ) + 6d GH (Gi, Gi) + 6d GH (G 2 , G' 2 ). 


Proof of Theorem 3. The remainder of this section devotes to the proof of the above theorem. 

Given two input metric graphs (Gi,cZgi) an d (G 2 ,dc 2 ), set 6 = dcH(Gi,G 2 ) to be the Gromov- 
Hausdorff distance between G\ and C 2 . Assume that the correspondence At * achieves this metric distortion 
distance 5 3 . Now for any point s £ Gi, there must exist t £ G 2 such that (s,t) £ Ai*. We will now 
show that ds( P S5 Qt) < 65. Symmetrically, we show that for any t £ G 2 , there is (s, t) £ Ai* such that 
ds( P s > Qt) < 65. Since such an t can be found for any point s £ Gi, and symmetrically, such an s can be 
found for any t £ G 2 , it then follows that the Hausdorff distance between C and T is bounded from above by 
65, proving the theorem. 

We will prove dB(P s ,Qt) < 65 for (s, t) £ At * with the help of another distance, the so-called 
functional-distortion distance between two Reeb graphs recently introduced in [3]. We recall its definition 
below. 

The functional-distortion distance is defined between two graphs Gi and G 2 , with a function / : Gi —>• IR 
and g : G 2 —> IR defined on each of them, respectively. First, we define the following (pseudo-)metrie 
on the input graphs as induced by / and g, respectively. (Note, that these metrics are different from the 
path-length distance metrics dc 1 and dc 2 that input graphs already come with.) Specifically, given two points 
x\,X 2 £ Gi, define df(x 1 , x 2 ) = min 7r:a . 1 ^ X2 height(7r), where ir ranges over all paths in Gi from x\ to x 2 , 
and height(-7r) = max xe7r fix) — min xe7r fix) is the maximum /-function value difference for points from 
the path tv. Define the metric d g for G 2 similarly. 

Now given two continuous maps </>_>. : Gi —> G 2 and : G 2 —> Gi, we consider the following 
continuous matching, which is a correspondence induced by a pair of continuous maps : 

M c {(j>^,(thr-) = {(x,<?Mx)) | X £ Gi}(J{0^(y),y) | y £ G 2 }. (3) 

The distortion induced by </_> and </><_ is defined as: 

</><-) = sup \\d f (x,x') - d g fy,y')\. (4) 

(x,y),(x',y')eMc(4>^,</><-) ^ 

3 It is possible that S is achieved in the limit, in which case, we consider a sequence of e-correspondences whose corresponding 
metric distortion distance converges to <5 as e tends to 0. For simplicity, we assume that 5 can be achieved by the correspondence 

M*. 
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The functional-distortion distance between two metric graphs (Gi, df) and (G 2 , d g ) is defined as: 

d F D(Gi,G 2 )= inf max{P(^, 0<_), max \f(x) - g o f^.{x)\, max \f o 0<_(y) - g(y)\}, (5) 

iSGi y SG2 

where and <i>^ range over all continuous maps between Gi and G 2 . It is shown in [3] that 

Theorem 4 ([3]) ds(Dg 0 /, Dg 0 g) < d F o((Gi, df), (G 2 , d g )), where Dg 0 / and Dg 0 g denote the Otli- 
dimensional persistence diagram induced by the function f : Gi — > 1R and g : G 2 —> 1R, respectively. 

In what follows, we show that d F D((Gi, df), (G 2 , d g )) < 65 for / = d Gl , s and g = dc 2 .t- Note that in 
this case Dg 0 / = P s and Dg 0 g = Q t . Combining with Theorem 4, this then implies dg(P s , Qt) < 65. 


Remark. We note that Theorem 4 extends to the case where we consider the lst-extended persistence 
diagrams for / and g, respectively, in which case the constant in front of d F D will change from 1 to 3. In other 
words, if we include the lst-extended persistence diagrams in our definitions of the persistence-distortion 
distance, then Theorem 3 still holds with a slightly worst constant 18 (instead of 6). 


Lemma 5 d FD ((Gi,d / ), (G 2 ,d g )) < 6 5 for f = d Gl , s and g = dc 2 ,t- 

Proof: First, we introduce the function-restricted Gromov e-Hciusdorff distance d rG H ((Gi, d Gl ), (G 2 , dc 2 )), 
which is a more restricted version of the Gromov-Hausdorff distance, defined as follows: 


drGH((Gi,d Gl ), (G 2,d G2 )) 

= inf max {^ max \d Gl (x 1 ,x 2 ) - d G2 (yi,y 2 )\, max \f(x)-g(y)\}}, (6) 

M 2 {x 1 ,y 1 ),ix2,y2)&M (x,y)GM 

where M. ranges over all correspondences between graphs G\ and C' 2 . Compai'ed to the definition of the 
Gromov-Hausdorff distance in Eqn (2), the functional Gromov-Hausdorff distance has an extra condition that 
the functional difference | f(x) — g(y) \ between a pair of corresponding pair of points x G G'i and y 6 C 2 
should also be small. 

We claim that dpn((Gi, d/), (G 2 , d 9 )) < 3d r cH({Gi,dc 1 ),(G2,dc 2 ))- The proof follows almost 
exactly the same as the proof of Theorem A. 1 of [3], Specifically, in Theorem A. 1 of [3], it states that 
<Ifd((Gi ,df), (G 2 ,d g )) < 3df G H((Gi,df), (G 2 , d g )), where the so-called functional Gromove-Hausdorff 
distance df G n((Gi,df), (G 2 , d g )) defined as: 


d fGH ((G 1 ,d f ),(G 2 ,d g )) 

= inf max {J max \d f (x 1 ,x 2 ) - d g (y 1 , y 2 )\, max \f(x) - g(y)\} }, 
M 2 {x 2 ,yi),(x 2 ,y 2 )&M ( x,y)eM 


where M. ranges over all correspondences between graphs G\ and G' 2 . In other words, the difference between 
df G H and our d rG n is that the metric on G \ (resp. on G 2 ) is dj versus the input graph metric d Gl (resp. d g ver¬ 
sus d G f). Nevertheless, it turns out that the proof of d F o((Gi, df), (G 2 , d g )) < 3df G n{(Gi, df), (G 2 , d g )) 
can be easily modified to prove d F D((Gi, d/), (G 2 , d s )) < 3d rG H((Gi,d Gl ),(G2,d G2 )). Specifically, 
one property that will be used many times in this modification is that df(xi,xf) < d Gl (x\,xf) for any 
x\ . X 2 £ G\ (a symmetric statement holds for points in G 2 ). Since the proof is almost verbatim of the proof 
for Theorem A. 1 of [3], we omit it here. 

Given the optimal correspondence M* for the Gromov-Hausdorff, it is easy to see that \f(x) — g(y)\ = 
\d GuS (x) — dc 2 .t(y)\ < 25 for any pair (x, y) E Ad*. For the correspondence Ad*, the other two terms in 
Equation 6 arc both bounded by 25. It then follows that 


d r GH((Gi,d Gl ), (G 2 ,d G2 )) <25. 
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Combining this with cIfd((Gi, df), (G 2 , d g )) < 3o? r aff((Gi, dcf), (G 2 , da 2 )), the lemma then follows. ■ 
We remark that one can in fact further modify the proof of Theorem A. 1 of [3] to obtain a smaller constant 
in Lemma 5. 

Putting everything together we obtain Theorem 3. 


4 Discrete PD-Distance 

Suppose we are given two metric graphs (Gi = (V\ ■ Ed). da ,) and (G 2 = (V 2 , E-i). da 2 ), where the shortest 
distance metrics da 1 and d(j 2 are induced by lengths associated with the edges in E\ U E^. As a simple 
warm-up, we first compute the following discrete version of persistence-distortion distance where only graph 
nodes in V\ and V 2 are considered: 

Definition 6 LetC := { I\, | v G V(Gi)} and E := {Q„ | u G V'fC'L)} be two discrete sets of persistence 
diagrams. The discrete persistence-distortion distance between Gi and G 2 , denoted by dpo(Gi, G 2 ), is 
given by the Hausdorff distance djj(C, E). 

We note that while we only consider graph nodes as base points, the local maxima of the resulting geodesic 
function may still occur in the middle of an edge. Nevertheless, for a fixed base point, each edge could have 
at most one local maximum, and its location can be decided in 0(1) time once the shortest-path distance 
from the base point to the endpoints of this edge are known. The observation below follows from the fact that 
geodesic distance is 1-Lipschitz (as the basedpoint moves) and the stability of persistence diagrams. 

Observation 7 dpo(Gi, G 2 ) < c!pd(Gi, G 2 ) < dpD(Gi, G 2 ) + f, where t is the largest length of any 
edge in E\ U E- 2 - 

Lemma 8 Given metric graphs Gi = ( V \, Ef) and G 2 = (V 2 , Ef, dpD(Gi,G 2 ) can be computed in 
0(n?m 1-5 logm) time, where n = max{|Vi|, | V 2 1} and m = max{|ifi |, lE^I}- 

Proof: For a given base point s G V\ (or t G V 2 ), computing the shortest path distance from s to all 
other graph nodes, as well as the persistence diagram P s (or Qt) takes Oirn log n) time. Hence it takes 
0(mn log n) total time to compute the two collections of persistence diagrams C = {P s | s G V (Gi)} and 
E = {Qt | t G F(G 2 )}. 

Each persistence diagram P s has O (m) number of points in the plane - it is easy to show that there are 
0{m) number of local maxima of the geodesic function df; | S (some of which may occur in the interior of 
graph edges). Since the birth time b of every persistence point (b, d) corresponds to a unique local maximum 
rib with f(u\:) = b, there can be only 0(m ) points (some of which may overlap each other) in the persistence 
diagram P s . 

Next, given two persistence diagrams P s and Q t , we need to compute the bottleneck distance between 
them. In [13], Efrat et al. gives an 0(k 1-5 log k) time algorithm to compute the optimal bijection between 
two input sets of k points P and Q in the plane such that the maximum distance between any mapped pair of 
points (p,q) <G I‘ x Q is minimized. This distance is also called the bottleneck distance, and let us denote it by 
dp,. The bottleneck distance between two persistence diagrams P s and Q t is similar to the bottleneck distance 
dp,, with the extra addition of diagonals. However, let P' and Q' denote the vertical projection of points 
in P s and Q t , respectively, onto the diagonal L. It is easy to show that ds( P,Q) = ds{ P s U (f. Qt U P'). 
Hence ds( P s > Qt) can be computed by the algorithm of [13] in 0(m 15 logm) time. Finally, to compute 
the Hausdorff distance between the two sets of persistence diagrams C and E, one can check for all pairs of 
persistence diagrams from these two sets, which takes 0(n 2 m 1,5 logm) time since the \C\ < n and \E\ < n. 
The lemma then follows. ■ 


By Observation 7, dpo(Gi, G 2 ) only provides an approximation of dpo(Gi, G 2 ) with an additive error 
as decided by the longest edge in the input graphs. For unweighted graphs (where all edges have length 
1), this gives an additive error of 1. This in turns provides a factor-2 approximation of the continuous 
persistence-distortion distance, since dpD(Gi, G 2 ) is necessarily an integer in this setting. 

Corollary 9 The discrete persistence-distortion distance provides a factor-2 approximation of the continuous 
persistence-distortion distance for two graphs Gi and G 2 with unit edge length; that is, dpofG ], G 2 ) < 
dpo(Gi, G 2 ) < 2dpo(Gi, G 2 ). 

One may add additional (steiner) nodes to edges of input graphs to reduce the longest edge length, so that 
the discrete persistence-distortion distance approximates the continuous one within a smaller additive error. 
But it is not clear how to bound the number of steiner nodes necessary for approximating the continuous 
distance within a multiplicative error, even for the case when all edges weights are approximately 1. Below 
we show how to directly compute the continuous persistence-distortion distance exactly in polynomial time. 

5 Computation of Continuous Persistence-distortion Distance 

We now present a polynomial-time algorithm to compute the (continuous) persistence-distortion distance be¬ 
tween two metric graphs (Gi = (Vi, Ei), dcf) and (G 2 = (V 2 , Ef), dc 2 )- As before, set n = rnaxj| Vj |. | V 2 1} 
and m = max {j 77] |, \Ef\ }. Below we first analyze how points in the persistence diagram change as we move 
the basepoint in Gi and G 2 continuously. 

5.1 Changes of persistence diagrams 

We first consider the scenario where the basepoint s moves within a fixed edge a 6 E\ of Gi, and analyze 
how the corresponding persistence diagram P s changes. Using notations from Section 2, let (ub, Ud ) be the 
critical-pair in Gi that gives rise to the persistence point (b, d) E P s - Then Mb is a maximum for the distance 
function dG 1:S , while Ud is an up-fork saddle for dc 1 , s - We call Mb and Ud from Gi the birth point and death 
point w.r.t. the persistence-point (b, d) in the persistence diagram. 

As the basepoint s moves to s' E a within e distance along the edge a for any e > 0, the distance function 
is perturbed by at most e; that is, ||d< 3 1)S — dc t , s ' Hoc < e. By the Stability Theorem of the persistence 
diagrams [6], we have that dj> { P s , Ps') < £■ Hence as the basepoint s moves continuously along a, points in 
the persistence diagram P s move continuously 4 . We now analyze how a specific point (b, d) may change its 
trajectory as s moves from one endpoint v\ of o = (ui, of E E\ to the other endpoint V 2 ■ 

Specifically, we use the arc-length parameterization of a for s, that is, s : [0, Len(a)] — y a. For any object 
X E {b, d, Mb, rid}, we use X(s) to denote the object X w.r.t. basepoint s(s). For example, (b(s),d(s)) 
is the persistence-point w.r.t. basepoint s(s), while Ub(s) and Md(s) are the corresponding pair of local 
maximum and up-fork saddle that give rise to (b(s), d(s)). We specifically refer to b : [0, Len(cr)] —> IR and 
d : [0, Len(cr)] —> 1R as the birth-time function and the death-time function, respectively. By the discussion 
from the previous paragraph, these two functions are continuous. 

Critical events. To describe the birth-time and death-time functions, we need to understand how the 
corresponding birth-point and death-point Ub(s) and Ud (V) in Gi change as the basepoint s varies. Recall 
that as s moves, the birth-time and death-time change continuously. However, the critical points Ub(s) and 

4 There could be new persistence points appearing or current points disappearing in the persistence diagram as s moves. Both 
creation and deletion necessarily happen on the diagonal of the persistence diagram as ds(P s , P s ') necessarily tends to 0 as s' 
approaches s. Nevertheless, for simplicity of presentation, for the time being, we describe the movement of persistence points 
ignoring their creation and deletion. Such creation and deletion will be addressed later in Section 5.1.3. 
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(b) 


Figure 3: For better illustration of ideas, we use height function defined on a line to show: (a) a max-max 
critical event at sq; and (b) a saddle-saddle critical event at sq. 


Ud(s) in Gi may (i) stay the same or move continuously, or (ii) have discontinuous jumps. Informally, if 
it is case (i), then we show below that we can describe b(s) and d(s) using a piecewise linear function 
with 0(1) complexity. Case (ii) happens when there is a critical event where two critical-pairs (ub, uf) 
and (u' b , u' d ) swap their pairing partners to (//;,. u' d ) and (u b , Ud). Specifically, at a critical event, since the 
birth-time and death-time functions are still continuous, it is necessary that either dc 1 , s {ub ) = ( k: t , s ('«),) or 
dc 1 , s (ud) = df;, a (u d ) m , we call the former a max-max critical event and the latter a saddle-saddle critical 
event. See Figure 3 for an illustration. It turns out that the birth-time function b : [0, Len(er)] —y M (resp. 
death-time function d) is a piecewise linear function whose complexity depends on the number of critical 
events, which we analyze below. 


5.1.1 The death-time function d : [0, Len(cr)] —> M 

The analysis of death-time function is simpler than that of the birth-time function; so we describe it first. 
Note, given that dG 1}S is the geodesic distance to the base point s, a merging of two components at an up-fork 
saddle cannot happen in the interior of an edge, unless at the basepoint s itself. 


Observation 10 An upper-fork saddle ugGi is necessarily a graph node from Vj with degree at least 3 
unless u = s. 


To simplify the exposition, we omit the case of u = s (which is an easier case) in our discussions below. 
Since the up-fork saddles now can only be graph nodes, as the basepoint s(s) moves, the death-point Ud(s) 
either (case-1) stays at the same graph node, or (case-2) switches to a different up-fork saddle u' A (i.e, a 
saddle-saddle critical event); see Figure 4. 

Now for any point i G Gj, we introduce 
the function g x : [0, Len(<r)] —y M which 
is the distance function from x to the mov¬ 
ing basepoint s(s) for s E [0, L a ]\ that is, 
g x (s) := dc 1 )S {s)i x )- Intuitively, as the base- 
point s(s) moves along a, the distance from 
s(s) to a fixed point x either increases or de¬ 
creases at unit speed, until it reaches a point 
where the shortest path from s(s) to x changes 



Len(a) 


(c) 


Figure 4: (c) Graph of function g x : [0, Len(cr)] —> 1R. 
discontinuously. We have the following observation. 


Claim 11 For any point x E Gi, as the basepoint s moves in an edge cr E E, the distance function 
g x : [0, Len(fi)] —> M defined as g x (.s) := dc i, s ( s )(x) is a piecewise linear function with at most 2 pieces, 
where each piece has slope either ‘1 ’ or ‘-1 See Figure 4 (c). 

Proof: Let v\ and v -2 be the two endpoints of the edge a where the basepoint s lies in. For a fixed point 
x E G\, first consider the shortest path tree T x with x being the source point (root). If the edge a is a 


10 














tree edge in the shortest path tree T x , then as s moves from v\ to v 2 , the shortest path from x to s changes 
continuously and the distance d Gl (x, s) increases or decreases at unit speed. In this case, the function g x 
contains only one linear piece with slope either ‘1’ (if s is moving towards x) or ‘-1’ (if s is moving away 
from x). 

Otherwise, the shortest distance to s(s) from x will be the shorter of the shortest distance to v, plus the 
distance from Vi to s(s), for i = l,or 2 and s E [0, Len(cr)]. That is, 

g x (s) = min{d Gl (x, vf) + s, d Gl (x, v 2 ) + Len(cr) - s}. 

The two functions in the above equation are linear with slope ‘1’ and ‘-1’, respectively. The graph of g x is 
the lower envelop of the graphs of these two linear functions, and the claim thus follows. 

We note that the break point of the function g x , where it changes to a different linear function, happens at 
the value so such that d Gl (x, v±) + so = d Gl ( x, v 2 ) + Len(cr) — so, and it is easy to check that s(so) is a 
local maximum of the distance function g x . m 

As s(s) moves, if the death-point (s) stays at the same up-fork saddle u, then by the above claim, the 
death-time function d (which locally equals g u ) is a piecewise linear function with at most 2 pieces. 

Now we consider (case-2) when a saddle-saddle critical event happens: Assume that as s passes value so> 
rtd(s) switches from a graph node u to another one v!. At the time s o when this swapping happens, we have 
that d ( ; lXi ( S0) (u) = d Cl ^( soj (ii'). In other words, the graph for function g„ and the graph for function g v > 
intersect at so- Before so, d follows the graph for the distance function g u , while after time sq, «d changes its 
identity to u' and thus the movement of d will then follow the distance function g u > for s > sq. Since the 
function g x is piecewise-linear (PL) with at most 2 pieces as shown in Figure 4 (c) for any point x E Gi, 
the switching for a fixed pair of nodes u and u' can happen at most once (as the graph of g u and that of g u i 
intersect at most once). Overall, since there are | V\ \ < n graph nodes, we conclude that: 

Lemma 12 Av s moves along a, there are ()( rr ) number of saddle-saddle critical events in the persistence 
diagram P s . 

For our later arguments, we need a stronger version of the above result. Specifically, imagine that we 
track the trajectory of the death-time d for a persistence pair (b. d). 

Proposition 13 For a fixed persistent point ( b(0),d(0)) E P s (o> the corresponding death-time function 
d : [0, Len(fj)] — > M is piecewise linear with at most 0(n) pieces, and each linear piece has slope either ‘1 ’ 
or ‘-1 This also implies that the function d is 1-Lipschitz. 

Proof: By Observation 10, Ud(s) is always a graph node from V\. For any node u, recall g u (s) = d Gl S ^(u). 
As described above, d(s) will follow certain g u with u = rtd(s) till the identify of Ud(s) changes at a 
saddle-saddle critical event between u with another up-fork saddle u'. Afterwards, d(s) will follow g u > till 
the next critical event. Since each piece of g v has slope either ‘1’ or ‘-F, the graph of d consists of linear 
pieces of slope ‘1’ or ‘-F. Note that this implies that the function d is a 1-Lipschitz function. 

On the other hand, for a specific graph node u E V, each linear piece in g u has slope ‘1’ or ‘-1’. This 
means that one linear piece in g u can intersect the graph of d at most once for s E [0, Len(cr)] as d is 
1-Lipschitz. Flence the graph of g u can intersect the graph of d at most twice; implying that the node u 
can appear as Ud{s) for at most two intervals of s values. Thus the total descriptive complexity of d is 
0(|Vi|) = 0(n), which completes the proof. ■ 

5.1.2 The Birth-time Function b : [0, Len(er)] —> M 

To track the trajectory of the birth-time b of a persistence pair (b(0), d(0)) E Po, we study the movements of 
its corresponding birth-point (which is a maximum) u b : [0, Len(cr)] ^ G in the graph. Flowever, unlike 
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up-fork saddles (which must be graph nodes), maxima of the distance function dci, s can a l so appear in the 
interior of a graph edge. Roughly speaking, in addition to degree-1 graph nodes, which must be local maxima, 
imagine the shortest path tree with s as the root (source), then for any non-tree edge, it will generate a local 
maximum of the distance function do llS . (Recall the maximum u in Figure 1 (b), which lies in the interior 
of edge (^ 3 ,^ 4 ). ) Nevertheless, the following result states that there can be at most one local maximum 
associated with each edge. 

Lemma 14 Given an arbitrary basepoint s, a maximum for the distance function d(; , .s : Gi —> M is either a 
degree-1 graph node, or a point v with at least t\\>o shortest paths to the basepoint s which are disjoint in a 
small neighborhood around v. 

Furthermore, there can be at most one maximum ofdc 1 , s in each edge in E\. 

Proof: Consider the shortest path tree T of Gi rooted at s. All degree-1 graph nodes in V will be tree leaves, 
and each of them is thus a local maximum for the distance function do ljS - For each such maximum, we 
associate it with the unique tree edge incident on it. 

Now take a maximum v which is not a degree-1 graph node. Set k to be the number of branches 
incident on v in a sufficiently small neighborhood of w. k = 2 if /; is in the interior of an edge of E\, and 
k = degree(v ) > 2 if v is a graph node. Since fie, ,sis larger than the distance from basepoint s to any 
other point in the neighborhood of v, and since the distance function is continuous, there must exist at least k 
different shortest paths from s to v, each one coming from a different branch around v (and thus disjoint in a 
small neighborhood around v). 

Furthermore, for each of the E\ — V\ + 1 number of edges not in the shortest path tree T rooted at s, say 
e = (wi,W2), it must contain one local maximum for the distance function d Gl ,bp■ Indeed, by property of 
shortest path distance, we know that d Gl!S (wi) < d GljS (w2) + Len(e) and d GljS (w2) < d Gl}S (wi) + Len(e). 
If the equality does not hold in either of these two relations, then as we move x from the endpoint with lower 
distance value, say w\, to W2 along e, the shortest distance must first increase and then decrease, meaning 
that there is a local maximum in the interior of e. Specifically, the local maximum happens at the point 
v E e such that d Glt s(wi) + |ir’i — n|| = d Gl}S (w2) + ||v — W2W, and there are two shortest paths from s 
to v, one passing through w\ and the other passing through W2 . If the equality holds for one of them, say 
do i,s(tt> 2 ) = dG 1 , s (wi) + Len(e), then W 2 may or may not be a local maximum. 

Overall, each edge in G, whether it is a tree edge or non-tree edge in T, will produce at most one local 
maximum for the distance function d GljS . The claim the follows. ■ 

As the basepoint s moves, the position of the local maximum within an edge may stay or may move 
continuously along the edge e. The above claim states that for a fixed basepoint, there can be at most one 
maximum in an edge e E E\. Hence instead of tracking (which could move continuously), we now 
associate the identity of «b with the birth-edge e b E E\ that contains Ub, and tracks the changes of the 
birth-edge eb : [0, Len(cr)] —> E\ as the basepoint moves: In particular, as s E [0, Len(cr)] changes, eb(s) 
could remain the same edge, or it could change to a different one. We now investigate each of these two 
cases. 


Case 1: eb does not change. For a fixed edge e € E 1 we introduce the function 
g e : [0, Len(cr)] — y M where, for any s E [0, Len(cr)], g e (s) is the distance from the 
basepoint s(s) to the unique maximum (if it exists) in e; g e (s) = +00 if the distance 
function d GlS ^ does not have a local maximum in e. We refer to the portion of g e 
with finite value as well-defined. Intuitively, the function g e serves as the same role 
as the distance function g x in Section 5.1.1, and similar to Claim 11, we have the 
following characterization for this distance function. 


.ge 





0 


0 Len{a 


Figure 5: Graph 

of function g e : 

[0, Len(cr)] —> 1R. 
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Proposition 15 For any edge e E E±, the well-defined portion of the function g e is 
a piecewise-linear function with 0 ( 1 ) pieces, where each piece is of slope ‘1 ‘-1 ’ 

or ‘O’. See Figure 5 for an illustration. 

Proof: We assume that e a; the case e = cr is simpler to handle. Let m(s) E e = (mi, m 2 ) be the maximum 
for distance function dc 1)S (s) w.r.t. basepoint s(s) E a = (ui, u 2 ). Note that g e (s) = d Gl S ( s )(m(s)). From 
the proof of Lemma 14, we know that 


^Gi,s(s )( m ( s )) = ^Gi,s( s )(^i) + \\wi ~ m(a)|| = d GlMs) (w 2 ) + ||m 2 - m(s)|| 


(V) 

( 8 ) 


giui(s) + ||mi - m(s)|| 


g W2 (s) + \\w 2 - m(s)||. 


(Recall that as introduced in Section 5.1.1, g x : [0,Len(cr)] —y 1R is defined as g x (s) = d Gl ^ s ' ) (x). ) 
Conversely, a point m(s) in the interior of e satisfying the equation above must be a local maximum of the 
distance function d GljS t s y By Claim 11, as s varies, g wi (resp. g W2 ) is a piecewise linear function with at 
most two pieces of slope ‘ 1 ’ or ‘-L. 

(1) If at s E [0, Len(cr)], the slopes of functions g,,,, and g W2 are the same (i.e, as s increases, c£gi,s(s) ( u ’i) 
and ^G 1 ,s(s)(m 2 ) both increase or both decrease at the same speed), then by Eqn ( 8 ), the local maximum 
m(s) remains the same as s moves. Hence g e (s) = d Gl iS ( s )(m(s)) follows a linear function with the same 
slope as g Wl (and g,,, 2 ) which is either ‘ 1 ’ or ‘- 1 ’. 

(2) If at s, the slopes of g wi and g W2 are not the same, (i.e, as s increases, d Gl s ( s )(mi) and dGi,s(s)( u ' 2 ) 
change in the opposite manner), then in order for Eqn ( 8 ) to hold, m(s) moves at the same speed as s(s). In 
this case, g e (s) remains the same value, that is, g e is a linear (in fact, constant) function with slope ‘O’. 

Now decompose [0, Len(cr)] into maximal intervals such that within each interval, g Wj and g „, 2 each can 
be described by a single linear function. By Claim 11 , there can be at most three such intervals. Within each 
interval, if a maximum exists in edge e, then the function g e (which is the distance to this maximum) can be 
described by a linear function of slope ‘1’, ‘-1’ or ‘O’, as described by the two cases above. 

Finally, note that in (2) above, as m(s) moves along e, it is possible that m(s) coincides with one of its 
endpoint say w\. After that, Eqn ( 8 ) cannot hold and the local maximum moves out of edge e - Indeed, one 
can verify that after that, the edge e becomes a tree edge in the shortest path tree rooted at s(s). In other 
words, afterwards, g e is no longer well-defined. Within a single maximal interval of [0, Len(cr)] as described 
above, such event can happen at most once for each of w\ and W 2 - Overall, the well-defined portion of g e 
consists 0(1) linear functions of slope ‘1’, ‘-1’ or ‘O’. ■ 

We remark that we can actually obtain a stronger characterization for the function g e , which states that 
the well-defined portion has to be connected, and consists of at most three pieces with a graph as shown in 
Figure 5 (any piece can be degenerate). However, the above proposition suffices for our later arguments. 

Case 2: e\> changes from edge e to <■. The change of the identity of eb could be due to that the local 
maximum Ub moves continuously from e to a neighboring edge e! that shares an endpoint with e. Alternatively, 
it could be caused by a max-max type critical event: Specifically, let u<± be the up-fork saddle currently paired 
with the current birth point Ub = u E eb generating the birth time b of (b, d) in the persistence diagram. 
At a max-max critical event, the up-fork saddle changes its pairing partner from Ub = u E eb to another 
maximum v! in edge e'. Afterwards, the identify of eb corresponding to the birth-time b will change to e!. At 
the time so when the swapping happens, d Gl ^ s ^{u) = d Gl ^ S0 ' ) (u'). It then follows that g e (so) = g e '( s o); 
that is, s 0 corresponds to an intersection point between the graph of the function g e and that of the function 
g e /. Since the function g e consists of 0(1) linear pieces for any e, there are 0(1) intersection points between 
a pair of e and e' from E\. We thus have: 

Lemma 16 There are 0{m 2 ) max-max critical events as the basepoint s moves along a fixed edge a E E. 
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As in the case of tracking the death-time function d, our later analysis requires a stronger result bounding 
the descriptive complexity of the birth-time function b : [0, Len(cr)] —> M, starting from a birth-time b(0) 
from a fixed persistence pair (b(0), d(0)) £ P s (p)- I n particular, we have the following proposition: 

Proposition 17 For a fixed (b(0), d(0)) £ P s (o)> the birth-time function b : [0,Len(er)] H> M, tracking 
birth-time b(0), is piecewise linear with at most 0(m ) pieces, and each linear piece has slope either ‘1 ‘-1 

or ‘O’. Note that this also implies that the function b is 1-Lipschitz. 

Proof: We track the edge et>(s) containing the maximum Ub(s) that gives rise to the birth-time b(s) for 
s £ [0, Len(cr)]. As described above, b(s) will follow g e for e = eb(s) till eb changes its identity to a new 
edge e'. Afterwards, b(s) will follow g e t till next time eb changes identity. By Proposition 15, b thus consists 
of a set linear linear functions, each of slope ‘P, ‘-1’, or ‘O’. Note that this also implies that b is a 1-Lipschitz 
function. 

We now bound the descriptive complexity of b. Note that any break-point between two consecutive linear 
pieces in b of different slopes necessarily involve at least one linear piece of slope either ‘P or ‘-P. So we 
can charge the number of break-points to the number of non-constant linear pieces in b. On the other hand, 
consider any non-constant piece from the function g e : This piece can appear in the graph of the function b at 
most once, because b is 1-Lipschitz, and g e has slope either ‘P or ‘-P. Since there are 0{m) edges in E \, 
there are 0{m) non-constant linear pieces from all functions g e , with e £ E±, which implies that there are 
only 0(m) number of breakpoints in b. This proves the le mm a. ■ 

Remark. The readers may have the following question: Recall that the function g e could contain portions 
which are not well-defined. Suppose at some point, e^ = e and b is following the graph of g e . What if we 
reach the endpoint so of the well-defined portion of g e ? We note that when this happens, as detailed in the 
proof of Proposition 15, the corresponding maximum u b currently is an endpoint say uq of e, and as the 
basepoint continues to change, either, ttb(s) moves to a neighboring edge e' of e incident on w \; or, w\ was a 
up-saddle prior to so and at time so, the max Ub = w\ cancel with rid = w\ (which we describe in Section 
5.1.3 below). Overall, as the Stability Theorem guarantees, b is necessarily a continuous function. 

5.1.3 Tracking the persistence pair (b, d) : [0, Len(er)] -> M 2 . 

Now consider the space H a := [0, Len(cr)] x M 2 , where M 2 denotes the birth-death 
plane: We can think of U a as the stacking of all the planes containing persistence dia¬ 
grams P s ( s ) for all s £ [0, Len(cr)]. Hence we refer to 1 [„ as the stacked persistence- 
space. For a fixed persistence pair (b,d) £ P s ( s ), as we vary s £ [0,Len(cr)], it 
traces out a trajectory i r = {(s, b(s),d(s)) | s £ [0, [Len(cr)]} £ n^, which is the 
same as the “vines” introduced by Cohen-Steiner et al. [8]. By Propositions 13 and 
17, the trajectory 7r is a polygonal curve with 0(n + m) = 0(m) linear pieces. See 
the right figure for an illustration, where there are three trajectories in the stacked 
persistence diagrams. 

In general, a trajectory (a vine) could appear or terminate within the range (0, Len(cr)). Specifically, as 
we track a specific point in the persistence diagram, it is possible that the pair of critical points giving rise to 
this persistent-point may coincide and cease to exist afterwards. In this case, the corresponding trajectory 
(vine) hits the diagonal of the persistence diagram (since as the two critical points coincide with Ub = u^, 
we have that b = d) and terminates. The inverse of this procedure indicates the creation of a new trajectory. 
Nevertheless, we can show that there can be 0(n + m) = 0{m) total number of trajectories in the stacked 
persistence diagrams (whether they span the entire range of s £ [0, Len(cr)] or not). We conclude with the 
following result. 
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Theorem 18 Let a G E\ be an arbitrary edge from the metric graph (G i, d Gl )■ As the basepoint s 
moves from one endpoint to another endpoint of a by s : [0, Len(cr)] — > a, the persistence-points in the 
persistence diagram P s ( s ) of the distance function dQ lS / s \ form 0{m) number of trajectories in the stacked 
persistence-space Ii a . Each trajectory is a polygonal curve ofO{m ) number of linear segments. 

A symmetric statement holds for metric graph (G 2 , dc 2 ). 

Proof of Theorem 18. As the basepoint s(s) moves along an edge a with s G [0, Len(<r)], we can think 
of the distance function d GlS ^ as a time-varying function with time range [0,Len(cr)]. For a general 
time-varying function, as we track a specific point in the persistence diagram [8], it is possible that the pair 
of critical points giving rise to this persistent-point may coincide and cease to exist afterwards. As already 
mentioned above, in this case, the trajectory (the vine) hits the diagonal of the persistence diagram (since as 
the two critical points coincide Ub = rid, we have that b = d) and terminates. The inverse of this procedure 
indicates the creation of a new trajectory. Hence a trajectory in the stacked persistence-diagrams may not 
span the entire range [0, Len(<r)]. 

We claim that there can be only 0(m ) number of trajectories in the stacked persistence diagram. In 
particular, first, note that at time s = 0, there can be 0(n + m) = 0(m ) number of persistence-points in 
the persistence diagram P s (o) for basepoint s(0). This is because that for a fixed basepoint, by Lemma 14, 
there can be only 0(n + m) number of local maxima for the distance function d GlS ( 0 ) : Gi —> DC thus 
generating 0(m) number of persistence-points in the persistence diagram. As a result, there can be at most 
0[m) trajectories spanning through the entire range [0, Len(cr)]. 

We next bound the number of trajectories not spanning the entire range. That is, these are the trajectories 
created or terminated at some time in (0, Len(cr)). For any such trajectory, assume without loss of generality 
that it tracks a persistence-point (b, d), and terminates at time so- (The case of creation of a new trajectory is 
symmetric.) At this point, it is necessary that the two critical points u\> (a local maximum) and Ud (a up-fork 
saddle) coincide. By Observation 10, the death-point Ud must be a graph node, say wo G V \. Hence Ub = wq 
as well; that is, wq is also a maximum of the distance function d Gl , S ( S0 )- We show that for a fixed graph node 
wo, such a scenario can happend at most once. 

Lemma 19 For a fixed graph node u’q G Vi, the birth-point Ub and death-point Ud can coincide at wq at 
most once as s varies in the range [0, Len(cr)]. 

Proof: As above, assume the trajectory hits the diagonal of the persistence diagram at time so, and wq is 
the corresponding coincided birth- and death-points. Suppose at s~ < so infinitesimally close to sq, the 
corresponding local maximum x~ = Ub(s~) comes from edge e incident on wq. Assume without loss of 
generality that s~ is sufficiently close to so such that there is no critical event of any kind and the local 
maximum Ub(s) approach continuously to wq as s tends to so (i.e, for s G (s~, sq )). 

Let w\ be the other endpoint of e . Since x~ is a maximum of d Gl S ( s -y by Lemma 
14, there are two shortest paths from s(s _ ) to x~ passing through w 0 and w\, which 
we denote by 7To and tt\, respectively. We show that 7To and 7Ti in fact are disjoint other 
than at their endpoints x~ and s(s _ ). See the right figure for an illustration. 

Indeed, consider the shortest path tree T~ rooted at s(s _ ), and let 2 be the common 
ancestor of wo and w \; 2 is necessarily a graph node of Vj unless 2 = s(s“). If 2 is 
a graph node, then as s varies, the distance to 2 either increases or decreases. However, the shortest path 
distance from 2 to wo and to w\ remain the same. Hence the distance functions g„. () = dc ljS (s)( w o) and 
g wi = d G] . s (. s )('U'i) either both increase or both decrease (because the shortest distance to them is the shortest 
distance to 2 plus the shortest distance from 2 to each of them). However, this falls into case (1) in the proof 
of Proposition 15, which means that the local maximum necessarily remains the same at x~ as s moves from 
s~ to sq, and will not move to wq- Contradiction. As such, 2 must be s(s“). In other words, the two shortest 
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paths 7Ti and 7r 2 meet only at s(s“) and x : Their concatenation form a simple loop C where x~ and s(s“) 
are a pair of antipodal points along this loop (i.e, they bisect C). As s moves to sq, its corresponding local 
maximum up(s) remains the antipodal point of s(s) and moves towards wq, and wo is the antipodal point of 

s(s 0 ). 

In other words, let v±,V 2 denote the two endpoints of the edge a where the basepoint s lies in. Since up 
is the antipodal point of s(so), we have that dc 1 (vi,wo) + so = dc x ( V 2 , up) + Len(cr) — so- Hence there is 
only one possible value for so- This proves the lemma. ■ 

It then follows that there can be at most 0(n) number of trajectories not spanning the entire time range 
[0, Len(a)] (created or terminated in the stacked persistence diagrams). Putting everything together, we have 
that there are at most 0(n + m) = 0(m) trajectories in the stacked persistence diagrams as the basepoint s 
moves in an edge a E E\. Combining this with Propositions 13 and 17, Theorem 18 then follows. 

5.2 Computing dpo(Gi, G 2 ) 

Given a pair of edges a s E Gi and a t E G 2 , as before, we parameterize the basepoints s and t by the 
arc-length parameterization of a s and ap, that is: s : [0, L s ] —> a s and t : [0, L t \ at where L s = Len(<r s ) 
and Lt = Len (at). We now introduce the following function to help compute dpo(Gi, G 2 ): 

Definition 20 The bottleneck distance function F tT3t<Tt : G —> HI is defined as F as /ri (s. t) i->- d/j(P s '.s)- Qs(/) )- 
For simplicity, we sometimes omit a s , at from the subscript when their choices are clear from the context. 

Recall that C = {P s | s E Gi}, T = {Qt | t E G 2 }, and by Definition 2: 

dpn(Gi,G 2 ) = maxjmaxmindn(P, Q), maxmindRiP, Q) |. 

Pec QeJ* 7 PeJ 7 Pec J 

Below we focus on computing dn{C, J 7 ) := maxp e c minQgjr de(P, Q), and the treatment of djt(T. C) := 
rnaxpgj- minp e c ds{ P, Q) is symmetric. It is easy to see: 

dfj(C, J~) = max min ((r(P, Q) = max max min min F„ a .(s,t). (9) 

Pec QeJ* 7 se[i,L s ] ^ec 2 te[t,G] 

In what follows, we present the descriptive complexity of F asj(Tt for a fixed pair of edges a s E Gi and 
a t E G 2 in Section 5.2.1, and show how to use it to compute the persistence-distortion in Section 5.2.2. 

5.2.1 One pair of edges a s E Gi and a t E G 2 . 

Recall that we call the plane containing the persistence diagrams as the birth-death plane, and for persistence- 
points in this plane, we follow the literature and measure their distance under the Loo-norm (recall Definition 
1 ). From now on, we refer to persistence-points in P s ( s ) as red points, while persistence-points in Qtu) as 
blue points. As s and t vary, the red and blue points move in the birth-death plane. By Theorem 18, the 
movement of each red (or blue) point traces out a polygonal curve with 0(m) segments (which are the 
projections of the trajectories from the stacked persistence diagrams onto the birth-death plane). 

Set O := [0, L s ] x [0, L t ] and we refer to it as the s-t domain. For a point (s, t) E G, the function value 
F(s, t)(= F as)(Jt (s, t)) = ds{ P s ( s ), Q t m) i s the bottleneck distance between the set of red and the set of 
blue points (with the addition of diagonals) in the birth-death plane. To simplify the exposition, in what 
follows we ignore the diagonals from the two persistence diagrams and only consider the bottleneck matching 
between red and blue points. 

Let r*(s) E P s ( s ) and b*(t) E Qtp) be the pair of red-blue points from the bottleneck matching between 
P s ( s ) and Qt(t) su °h that doo(r*(s), b*(t )) = de( P s ( s )j We call (r* (s), b* (t)) the bottleneck pair (of 

red-blue points) w.r.t. (s, t). As s and t vary continuously, red and blue points move continuously in the 
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birth-death plane. The distance between any pair of red-blue points change continuously. The bottleneck pair 
between P s ( s ) and Qt(t) typically remains the same till certain critical values of the parameters (s. t). 

Characterizing critical (s,t) values. Given (s,t), consider the 
optimal bottleneck matching C* (s, t) : P s x Q t . For any correspond¬ 
ing pair (r(s), 6 (f)) E C*(s,t), doo(r(s),b(t)) < doo(r*(s),b*(t)). 

Suppose r*(s ) = n(s) and b*(t) = 61 (f). As (s, t ) varies in Q, the 
bottleneck pair (r*(s), b*(t)) may change only when: 

• (case-1): (ri(s), 61 (f)) ceases to be a matched pair in the 
optimal matching C*(s, t); or 

• (case-2): (r\(s), 61 (f)) is still in C*, but another matched pair 
(r 2 (s), 62 (f)) becomes the bottleneck pair. 

At the time (sq, fo) that either cases above happens, it is necessary that there are two red-blue pairs, 
one of which being (n., 61 ), and denoting the other one by ( 7 - 2 , 62 ), such that doo(n(so), 61 (fo)) = 
doo{r 2 (so), 62 ( 60 ))- (For case-1, we have that either 7-2 = r\ or 62 = 61 .) Hence all critical (s,f) val¬ 
ues are included in those (s, f) values for which two red-blue pairs of persistence-points acquire equal 
distance in the birth-death plane. Let 

X (r 1 ,b 1 ),{r 2 ,b 2 ) : = {(M) I doo{n(s), 61 (f)) = doo(?’ 2 (s), 62 (f))} 

denote the set of potential critical (s,t)-values generated by (ri, 61 ) and ( 7 - 2 , 62 ). To describe -X'(r 1 ,& 1 ),(r 2 , 6 2 )> 
we first consider, for a fixed pair of red-blue points (r, 6 ), the distance function D r ^ : [0, L s \ x [0, Lt\ —> IR 
defined as the distance between this pair of red and blue points in the birth-death plane, that is, D r ^(s , f) := 
(too(r(s), 6 (f)) for any (s,t) E Pi. 

In particular, recall that by Theorem 18, r : [0, L s \ —> M 2 (resp. 6 : [0, L t \ —> M 2 ) is continuous and 
piecewise-linear with ()(rn) segments. In other words, the range [0, L s \ (resp. [0, L t \) can be decomposed to 
0(m) intervals such that within each interval, r moves (resp. 6 moves) along a line in the birth-death plane 
with fixed speed. Hence combining Propositions 13 and 17, we have the following: 

Proposition 21 The s-t domain Q can be decomposed into an 0(m) x 0(m) grid such that, within each 
of the 0(m 2 ) grid cell, D r j, is piecewise-linear with 0 ( 1 ) linear pieces, and the partial derivative of each 
piece w.r.t. s or w.r.t. t is either ‘1 ’, ‘-1 ’, or ‘O’. 

Proof: Let X s (resp. It) denote the decomposition of [0, L s \ (resp. [0, Lt\) into 0(m) intervals within each of 
which the red (persistence) point r G P s (resp. the blue persistence point 6 E Qt) moves along a line in the 
birth-death plane. In fact, by Propositions 13 and 17, we also have that the birth-coordinate r.x for the red 
point r either increases or decreases at the unit speed (w.r.t. the parameter s), and the death-coordinate r.y of 
r either increases or decreases at the unit speed, or is stationary. Similar statements hold for the blue point 
f. Since D r ^(s, t) = doo(r(s), 6(f)) = max{|r.x(s) — 6.x(f)|, | r.y(s) — b.y(t)\}, it follows that for a fixed 
interval I\ E I s and I 2 E It, O r .h ■ h X I 2 —> M is piecewise-linear function with 0(1) linear pieces, where 
the partial derivative of each piece w.r.t. s or to f is either ‘1’, ‘-1’, or ‘O’. ■ 

Given two pairs of red-blue pairs (n, 61 ) and ( 7 - 2 , 62 ), the set A7 r| ),( r 2 , 6 2 ) °f potential critical (s,t) 
values generated by them corresponds to the intersection of the graph of D ri and that of /) r2 j, 2 . By 
overlaying the two 0(m) x 0(m) grids corresponding to D ri j, t and j, 2 as specified by Proposition 21, 
we obtain another grid of size 0(m) x 0(m) and within each cell, the intersection of the graphs of I) r] 
and D r2 b 2 has 0(1) complexity. Hence, we have: 

Corollary 22 The set -X'(r 1 ,& 1 ),(r 2 ,& 2 ) — consists of a set of polygonal cur\’es in the s-t domain bl with 
0(m 2 ) total complexity. 
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Consider the arrangement Arr(Q) of the set of curves in X = {^(n,bi),(r 2 ,b 2 ) I r i : r 2 £ P s ,61,62 £ Qt}- 
Since there are altogether 0(m 4 ) x O(m 2 ) = O (m 6 ) segments in A", we have that the arrangement Arr(ll) 
has 0 (m 12 ) complexity; that is, there are 0 (m 12 ) number of vertices, edges and polygonal cells. However, 
this arrangement Arr(Q) is more refined than necessary. Specifically, within a single cell c 6 Arr(fi), the 
entire bottleneck matching C* does not change. By a much more sophisticated argument, we can prove the 
following: 

Proposition 23 There is a planar decomposition A (ft) of the s-t domain f i with () (rtf) number of vertices, 
edges and polygonal cells such that as (s,t) varies within in each cell c £ A(Tl), the pair of red-blue 
persistence points that generates the bottleneck pair (r*, If ) remains the same. 

Furthermore, the decomposition A(H), as well as the bottleneck pair (r* . b*) associated to each cell, can 
be computed in 0(m 9 ' 5 logm) time. 

Proof: First, consider the decomposition of O into maximal cells within each of which the bottleneck pair 
does not change its identity. We refer to each such cell as a fixed-bottleneck-pair cell. Consider such a 
cell c and assume that within this cell c the bottleneck pair is (r*, b *) = (77 . b\ ). The boudary of c is a 
polygonal curve 7 , each linear segment of which corresponds to potential critical (s,t)-values where the 
red-blue pair (n, 61) has equal distance with some other red-blue pair, say (77, 62)- Each vertex, say v in this 
boundary curve 7 is where two segments meet, say one corresponding to ( 77 , 61) and ( 77 , bf), and the other 
corresponding to ( 77 , 61) and (73, 63). 

The vertices in 7 are of two types: (Type-1): (r 2 , bo) = ( 77 , 63 ) where v is also a vertex in a polygonal 
curve from Aq ri 77 t ( r2 ,bf’ (Type-2): remaining case where v = (so, A) represents the moment the red-blue 
pair ( 77 , 61 ) has distance equal to that of the two other red-blue pairs: ( 77 , 62 ) and (r 3 , 63 ). 

Intuitively, the segment of the boundary curve 7 , where ( 77 , 61 ) and ( 7 - 2 , 62 ) have equal distances, 
represents the boundary between two cells c and c' such that (i) the bottleneck pair is ( 77 ,61) when (s, t) £ c, 
(ii) as we cross this curve, the bottleneck pair may change to (7-2, 62) when (s, t) £ d , and (iii) along this 
curve, there is ambiguity and the bottleneck matching can be either ( 77 , 61) or (7-2, 62). 

Type -1 vertices are vertices from the same polygonal curve of where ( 77 ,61) and (7-2, 62) are at equal 
distances. Type -2 vertices are where this curve meets another curve representing the (s,t)-values where 
( 77 , 61) and ( 7 - 3 , 63) are at equal distances. Hence Type -2 vertices represent the places where three fixed- 
bottleneck-pair cells meet. 

By Corollary 22 , there are ()(rn 4 ) x 0(m 2 ) = O (m (> ) number of Type -1 vertices. We now show that the 
number of Type -2 vertices is 0[m 8 ) and we can compute all Type -2 vertices in 0(m 9 ' 5 log m) time. 

Note that each Type -2 vertex is induced by three red points (t™i , 7^2,7*3) and three blue points (61, 62, 63). 
First, enumerate all 0(m 6 ) possible triples of red-blue pairs. For each triple (77,61), (7-2, 62) and (7-3, 63), 
consider the graphs of functions D rii } )1 , D r2 ^ 2 , and D r 3 .7. The intersection of all three graphs are a super-set 
for Type -2 vertices generated by (77, 61), (77, 62) and (7-3, 63). It follows from Proposition 21 that there are 
0(m 2 ) intersection points of the three graphs - Specifically, we overlay the three O(m) x 0(m ) grids as 
specified by Proposition 21 , and within each cell of the resulting grid which is still of size 0{m) x 0(m), each 
function D Ti ^ i has 0 ( 1 ) complexity, and thus they produce 0 ( 1 ) intersection points. For each intersection 
point, we spend 0 (t7i 1,5 log rn) time using the modified algorithm of [ 13 ] to compute its bottleneck matching, 
and check whether this is a valid Type -2 vertex or not. Altogether, since there are 0(m 6 ) triples we need to 
check, there can be 0(m 8 ) Type -2 vertices, and they can be identified in 0 (t? 7 9 ' 5 logm) time. Fet E denote 
the resulting set of Type -2 vertices. 

With E computed, we next construct the decomposition A(Q) of Q into fixed-bottleneck-pair cells. To do 
this, we simply scan all vertices in E from left to right. For each vertex v £ E corresponding to the three 
red-blue pairs ( 77 , 61), (7-2, 62) and ( 77 , 63), we know that locally, there are three branches from v: one from 
4 , h i .r 2 .b ’2 , one from -Xj-i ,6^73,63 a °d one from X V2 j , 2 r: , j K ,. We simply trace each such curve till we meet 
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another vertex from £. Now consider the graph whose nodes are Type-2 vertices, and arcs are polygonal 
curves connecting them. We can use any graph traversal strategy (such as BFS) to traverse all arcs and thus 
connecting nodes. The total time is 

OfTime to traverse the graph) + OfTime to trace out all arcs). 

Since this graph is planar with 0(m 8 ) vertices, there are 0(m 8 ) arcs as well. Hence OfTime to traverse 
the graph) = 0(m 8 ). The time to trace an arc from one Type-2 vertex to the other is proportional to the 
complexity of this polygonal curve. We charge this time to the number of interior Type-1 vertices in this arc, 
as well as the two boundary Type-2 vertices of this arc. By Corollary 22, the polygonal curves from X has 
Ofm 2 ) x Ofm 4 ) = 0 (m 6 ) total complexity. Hence the total time to trace out all arcs is also bounded by 
Ofm 8 ). Putting everything together, we have that, once the Type-2 vertices are computed, we can construct 
A(H) in time 0(m 8 ). This completes the proof. ■ 

Our goal is to compute the bottleneck distance function F : Q —> 1R introduced at the beginning of 
this subsection where F(s, t ) i->- ds( P s ( s ), Qt(t)) = doo(r*(s ), b*(t)), so as to further compute persistence- 
distortion distance using Eqn (9). To do this, we need to further refine the decomposition A(fl) from 
Proposition 23 to another decomposition A(f2) as described below so that within each cell, the bottleneck 
distance function F (Jg ^ t can be described by a single linear function. 

Theorem 24 For a fixed pair of edges a s E Gi and at E G2, there is a planar polygonal decomposition 
A(Q) of the s-t domain Q ofO(m 10 ) complexity such that within each cell, the bottleneck distance function 
Fas,<n /v linear- Furthermore, one can compute this decomposition A (fi) as well as the function F rj , rSl in 
Ofm 10 logm) time. 

Proof: By Proposition 23, given any cell c E A(fi), the bottleneck pair (r*. b *) remains the same. In other 
words, let r c = r* and b c = b* for any (s, t) E c. We have F(s, t ) = doo(r c (s),b c (t))(= D rc: b c (s, t)) for 
(s, t) E c. Let A rrJ)<: (Q) be the decomposition of <> such that within each cell of .4 rV: ;, r: , the function D r b c 
is a linear function. By Proposition 21, A r . c y r consists of Ofm 2 ) cells, edges and vertices. Hence we can 
further decompose (refine) the cell c to be the intersection of c with A rc j >c - We perform this refinement 
for each cell c E A(fi), and denote the resulting decomposition as A(Q). By construction, the bottleneck 
distance F within each cell of A(<2) is a single linear function. 

Next, we bound the complexity of A(f2). First, note that the number of newly added vertices within the 
interior of a cell c E Affl) is bounded from above by Ofm 2 ), since each such vertex is a vertex from A rr j t<: . 
While there can be Ofm 8 ) number of cells in A(Q). there can only be Ofm 2 ) choices of bottleneck pairs 
(r*, b*)s. Hence the total number of vertices in the interior cells in A(H) is 0(?n 4 ). 

What remains is to bound the number of vertices along edges of A (Q). To this end, notice that each 
edge e E A(fi) has two incident cells ci and C2. Any newly added vertex in e must be either an intersection 
between e with some edge in A r b ci > or with some edge in A rc2 ^ C2 ■ Hence the total number of such vertices 
on e is 0(m 2 ). Since there are 0(m 8 ) edges in A(< >). the total number of newly added vertices is at most 
0(m 10 ). Thus the complexity of A(J2) is 0(m 10 ). 

Finally, the refined decomposition A(fi) can be computed in ()(rn U] log rn) time. Specifically, first, 
it takes 0(m 9 5 logm) time to compute A(Q) by Proposition 23. Next, for each cell c with k number of 
boundary edges, it takes Of (A: + m 2 ) log rn) time to compute the intersection c n A rr j, r . Summing over all 
cells in A(Q) gives the claimed time complexity. ■ 

5.2.2 Final algorithm and analysis. 
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We now aim to compute dji(C. F) using Eqn (9). First, for 
a fixed edge a s E Gi, consider the following lower-envelop 
function 


C : [0, L s \ —> 1R where C(s) 


i —> min min F(s,t), (10) 
o-t 6G 2 te[o,L t ] 


where recall L s and L t denote the length of edge a s and a t 
respectively. The reason behind the name “lower-envelop func¬ 
tion” will become clear shortly. 

Now for each at E G 2 , consider the polygonal decompo¬ 
sition A(fl) as described in Theorem 24. Since within each cell 
the bottleneck distance function F is a linear piece, we know 
that for any s, the extreme of F(s, t ) for all possible t E [0, Lt] 



Figure 6: (a) s-t domains for a s E E\ and 


edges of E E 2 . (b) C(s) is the lowerest 
value along any F ei . 

must come from some edge in A(fi). In other words, to compute the function min te r 0j £ t ] F(s, t ) at any 


s E [0, L s \, we only need to inspect the function F restricted to edges in the refined decomposition A(Q agjrrt ) 
for the s-t domain Ll (Tg ^ t = [0, L s ] x [0, L t \. Take any edge e of A(f2 CTsi(Tt ), define 7r e : [0, L s \ —> [0, L t ] 
such that (s, vr e (s)) E e. Now denote by the function F e : [0, L s ] —> IR as the projection of F onto the first 
parameter [0, L s \\ that is, F e (s) := F(s,ir e (s)). Let E (7g := {e E A(f2 CTBi<rt ) | at G G 2 } be the union of 
edges from the refined decompositions of the s-t domain formed by a s and any edge a t from G 2 . It is easy to 
see that (see Figure 6): 


C(s) = min F e (s ); that is, C is the lower-envelop of linear functions F e for all e E E (Jg . 

e&E ag 

There are 0{m) edges in G 2 , thus by Theorem 24 we have \E Cg \ = 0(m n ). The lower envelop C of 
\E (Jg | number of linear functions (linear segments), is a piecewise-linear function with 0(\ E rJs \ = ()(m l 1 ) 
complexity and can be computed in 0(\E ag \ log \E ag |) = 0(riS 1 log rn) time. Finally, from Eqn (9), 
dn{C,F) = rnaXfj^Gi m a x sG[o,L s l S(s). Since there are (){rn) choices for a s , we conclude with the 
following main result. 


Theorem 25 Given two metric graphs (Gi, da ,) and (G 2 , d (;. 2 ) with n total vertices and m total edges, we 
can compute the persistence-distortion distance dpD(Gi, G 2 ) between them in 0(m 12 logn) time. 

We remark that if both input graphs are metric trees, then we can compute their persistence-distortion 
distance more efficiently in 0(n 8 logn) time. 


6 Preliminary Experiments 

We show two sets of preliminary experimental results. The first experiment aims to demonstrate the stability 
of the proposed persistence-distortion distance, by showing that the persistence-distortion distance between 
a graph and a noisy sample of it remains stable w.r.t. the noise added. In the second experiment, we apply 
our persistence-distortion distance to compare a set of surface models, using simply the 1-skeleton of their 
mesh models, and show that this distance is robust again non-rigid but near-isometric deformations (such as 
different poses between humans, or between wolfs and horses), while it still can differentiate different models. 
In both experiments, to improve the efficiency, we only compute the persistence diagrams to a subset of graph 
nodes of input graphs, and obtain an even coarser version of the discrete persistence-distortion distance for 
input graphs. 

We also point out that in our experiments, we compute the 0-th zigzag persistence diagram for each 
basepoint. However, we observe little difference in results if only the 0-th standard persistence diagrams are 
used. 
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(a) 


(b) 




(d) 


Figure 7: (a) Hidden graph (Athens road map) G. (b) and (c) noisy 1-skeleton of Rips complex RJ' for 
r = 0.1 and r = 0.13 respectively, (d) The growth of the persistence-distortion distance dpi) {R\. G ) w.r.t. 
the parameter r in the Rips complex R r . Note that the noise level is e = The vertical range (two 
triangle-points) shows the max and min dpc values for 10 different re-samples (with noise) (i.e, for each 
noise level, we take 10 sets of samples) - the middle curve is the average dpp values of these 10 sets. 


Experiment 1. The first experiment aims to demonstrate the stability of the proposed persistence-distortion 
distance. Specifically, we consider a set of noisy points P sampled from a hidden graph G = ( V. E ), and 
compute the the Rips complex R r (P) of P as an approximation of the hidden graph G. The hidden graph G 
taken in this case is a part of the Athens road network. We obtain a noisy sample P e by perturbing each point 
from a discrete sample of G uniformly randomly within error e. We then build a Rips complex lZ r ( e \P £ ) 
with parameter r{e) = We then treat the 1-skeleton R’^ ] of lZ r ^ £ \P e ) as a metric graph, and this metric 
graph offers a noisy approximation of the hidden graph G. Examples of G and R ,\''' ] s are shown in 
Figure 7 (a), (b) and (c). 

To speedup the computation, we compute only the persistence diagrams at a set of 5-sparse subsamples 
Q C V and Q' C P £ , and only use points in Q and Q' as basepoints. Specifically, Q is obtained by the 
following randomized procedure: Take a random permutation of V. Process each node v r £ V in this order. 
We add Vj into 0 only if its distance to cuixent points in Q is larger than 5. Q' is obtained from P £ in a similar 
manner. We use a random order so as to further demonstrate the robustness against different discretization. 
By Theorem 3, one can show that this incurs at most 125 error in the estimation of dpr)(77 l p \ G). In our 
experiments, the size of the subsmapled sets Q and Q' are usually between 150 and 200 points. The time 
required for computing the discrete persistence-distortion distance using Q and Q' as basepoints, is observed 
to be from 20 ~ 30 seconds. 

In Figure 7 (d), we show the growth of the persistence-distortion distance dpD(7^ e \ G) with respect 
to the change of the noise level e; recall that the parameter r(e) = . We note that <1 pd(77| , G) grows 

roughly proportionally to the noise level, demonstrating its stability. We note that there is a small jump 
of dpD(7^i, G) from r(e) = 0.02 to r(e) = 0.025. This is because when r(e) increases, small loops (1st 
homology features) get created in the top-right part of the graph (from Figure 7 (b) to (c)). This shows that 
our persistence-distortion distance captures such small topological changes. 
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A. double torus(6734) 


B. kitten(8251) 


C. genus-3(4670) 


D. botijo(8770) 




h 


ft 


| E. armadillo(19096) 

ill 


F. humanl(12500) 


G. human2(12500) 


H. human3(12500) 


M. horse2(19248) 


N. horse3(19248) 


0 . wolfl(4344) 


P. wolf2(4344) 


I. centaurl(15768) 


J. centaur2(15768) 


K. centaur3(15768) 


L. horsel(19248) 


Figure 8: Models used for comparison. 
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P 

A 

0.0265 

0.2402 

0.2905 

0.2653 

0.3038 

0.3673 

0.3601 

0.3400 

0.2706 

0.2533 

0.2430 

0.2425 

0.2397 

0.2492 

0.2435 

0.2414 

B 

0.2402 

0.0416 

0.3235 

0.2652 

0.2643 

0.2739 

0.2695 

0.2722 

0.2697 

0.2500 

0.2360 

0.2387 

0.2349 

0.2425 

0.2405 

0.2404 

C 

0.2905 

0.3235 

0.0417 

0.2619 

0.4011 

0.3985 

0.3928 

0.3921 

0.4013 

0.4013 

0.3892 

0.3851 

0.3928 

0.3924 

0.3947 

0.3995 

D 

0.2653 

0.2652 

0.2619 

0.0374 

0.2761 

0.2750 

0.2769 

0.2794 

0.2775 

0.2767 

0.2789 

0.2802 

0.2782 

0.2792 

0.2806 

0.2811 

E 

0.3038 

0.2643 

0.4011 

0.2761 

0.0468 

0.1727 

0.2033 

0.2015 
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0.1490 

0.1188 

0.1133 

0.1137 

0.1253 
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Figure 9: Pairwise persistence-distortion distances between models. 
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Experiment 2. In the second experiment, we apply our persistence-distortion distance to compare surface 
meshes of different geometric models, some of which are different poses of the same object. The set of 
surface models are shown in Figure 8. For each surface model, we take the 1-skeleton Ki = (Vj. E)) of its 
surface mesh as input. Like in the first experiment, we also compute only the persistence diagrams at a set of 
5-sparse subsamples Q-, C Vj from input surface mesh constructed by a randomized decimation procedure. 
Again by Theorem 3, one can show that this incurs at most 125 error in the estimation of dpi) ( K,, Kj). 

Figure 9 shows the matrix of pairwise persistence-distortion distance between all pairs of models. Because 
the subsamples are generated by a randomized procedure, the resulting persistence-distortion distance for 
the same two graphs may be non-zero (as the basepoints chosen may be different). Nevertheless, note that 
distance values at the diagonal are usually small, implying that persistence-distortion is stable against different 
discretization of the same graph. 

From the matrix in Figure 9, we can see that models from the same group (such as humanl, human2 
and human3) have very small persistence-distortiondistances among them (darker colors for smaller values). 
Furthermore, models from similar groups (such as between wolves and horses) have persistence-distortion 
distances smaller than those between dissimilar groups (such as between wolves and double-torus). This 
demonstrates that our persistence-distortion distance is a reasonable measure for differentiating surface 
models. 

The number of vertices of an input mesh for each model is shown in brackets in Figure 8 after the model 
name. The size of the subsample of a graph is usually kept between 200 and 300. The time for computing 
the persistence-distortiondistance is typically less than 10 seconds. For the exceptional case involving two 
armadillos where the input graphs have large sizes, the running time is around 20 seconds. 

We also remark that it is possible to take simply the 1-skeleton of the Rips complex constructed from 
the point samples Vi of a surface mesh instead of using the surface mesh itself. We expect to obtain similar 
results though the complex size will most likely be larger. 

7 Conclusions and Future Directions 

In this paper, we proposed a new way to measure distance between metric graphs, called the persistence- 
distortion distance. This distance is developed based on a topological idea, and provides a new angle to the 
metric graph comparison problem. The proposed persistence-distortion distance is stable with respect to 
metric distortion, and align the underlying space of input graphs (instead of just graph nodes). Despite that 
we consider all points in input graphs, we present a polynomial time algorithm to compute the persistence- 
distortion distance. 

The time complexity for computing the (continuous) persistence-distortion distance is high. A worthwhile 
endeavor will be to bring it down with more accurate analysis. In particular, the geodesic distance function 
(to a basepoint) in the graph has many special properties, some of which we already leverage. It will be 
interesting to see whether we can further leverage these properties to reduce the bound on the decomposition 
A(f2) as used in Theorem 24. Developing efficient approximation algorithms for computing the persistence- 
distortion distance is also an interesting question. Also, the special case of metric trees is worthwhile to 
investigate. Notice that even discrete tree matching is still a hard problem for unlabeled trees, i.e, when no 
correspondences between tree nodes are given. 
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